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Related Documents 

1. For a description of the input to the NARP assembler (i.e., 
the source language) see Project GENIE Document R-32, 
Reference Manual for MRP, An Assembler for the- SDS 9^0 , 
issued 

2. For a description of the output produced by the MRP assembler 
(i.e. j the input to the loader, DDT) see Project GENIE 
Document R-g?, Format of Binary Program Input to DDT , issued 
January 26, 1967. 

3. For a description of the SDS 9*+0 and its instructions set see 
one of the following: 

a. SDS publication: SD S $k0 Computer Reference Manual , 
No. 90.06.UOA, August 1966. 

b. Project GENIE Document R-27: SDS 930 Instructions , 
issued October 11, 1966. 

k. For a description of ARPAS, the predecessor of NARP, see 
Project GENIE Document R-26, ARPAS, Reference Manual for 
T ime-Sharing Assembler for the SDS 930, issued February 2k, I967. 
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1.0 Introduction 

This document is intended for use as a companion to the 
listing of the MRP program, and thus only contains information 
not given in the comments to the program- Therefore, precise 
table formats will be found in this report, but no specification 
of input or output for subroutines, and only sketches of most 
algorithms . 

This report will probably be of most use to those who wish 
to correct bugs or to make changes or additions to MRP. It 
is not written as a primer on assembly writing, but some motivation 
for the implementation is given, so that a relatively experienced 
programmer should be able to use it with little trouble. 

Following is a short description of the over-all workings 
of MRP: When MRP is called by a user, the entire program is 
read off the drum and into core, where it stays until the assembly 
is over. The assembly is done in one pass: MRP reads an input 
file containing a source program and writes an output file 
containing relocatable binary program ready to be loaded by DDT; 
no other input/output operations are performed except for 
messages on the teletype, and perhaps a listing. As the program 
is translated, several tables are built up in core, the principal 
ones being the main table (containing literals and definitions of 
symbols and opcodes) and the string storage (containing macro 
definitions ) . 

The basic scheme for processing an instruction in the source 
language is quite simple: The opcode is looked up in the main 
table to get its value and this is merged with the value of the 
operand which is obtained by evaluating the expression in the 
operand field. The instruction so constructed is then output. 
The evaluation of the expression may involve the lookup of 
symbols in the main table to get their values. If a symbol is 
undefined, a chain of references to it is created in the object 
program, and the loader replaces the links of this chain with 
the actual value of the symbol at load time. Undefined expressions 
consisting of more than one undefined symbol are saved in a 



1-2 

tabic in core until all the symbols in them became defined, at 
which time the expressions are evaluated. 

Of course, the general scheme above doesn't account for the 
processing of directives or macros, which makes up an appreciable 
parr of the code for NARP. Refer to the listing and the following 
report for more details. 
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'.0 Organization of NARP in Core 



2.1 Core Maps 



When MP is running, the core is used as follows: 



o 



10000, 




Working storage (corresponds 
to TE'jyIP package); see below 
for details . 



Read-only code (corresponds 
to MIGC, CENTRAL, DIRECT, and 
MAC packages); see first page 
of NARP listing for organization 
of this part of core. 



See Figure 2-1 for the details of the first five blocks of core. 
The areas in these blocks are discussed below: 

pop communication cell ; This is simply cell 0, used by pops to 

transmit parameters and return address. 
_cells JL_ through 77B: These cells are not used by NARP or the 

system. 
pop links : This is cell 100B through 177B; it contains the links 

for the pops used by MRP. This area is initialized when NARP 

is started and then never altered again. 
programmed interrupt links ; These are cells 201B through 221+B; 

if programmed interrupts are used, these cells contain pointers 

oo the routines to handle the interrupts; they are not used 

or altered by MRP at present. 



.'.. .Vj/v.o ;;'ij!,r'yU'-j.r.',i, 



1010 



5000 

2250 

1000 

40 
90 

99 
25 

-» 276 
300 

150 
10240 



I DTAB and MACT 



normal terrros 



recursive temps 



recursive subroutine links 



normal subroutine links 



freeze area 



table bounds 



initial reference tables 



main table 



string storage 



expression table 



i input pointer stack 



character stack 



operand stack 



I operator stack 



pile 



input buffer 



output buffer 



6.4 

2.1 

2.1 and 6.5 

2.2 

2.2 

7.0 

2.1 

4.1 

4.1 

6.0 

4.4 
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I 4.2 

h.3 

6.5 

2.1 and 3.0 

2.1 and 3-0 
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These are subroutines that are not pure 
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procedures bin, whose instructions are altered periodically (in 

order to make the subroutines as Vent ns possible), Theue 

subroutines pack and unpack characters, 
JJTA B and MOT: These are two tobies used in processing rmcro 

act' ir.it ions. They are placed in read-vrite memory because they 

are altered. 
norr :nitcrips: These ere the temporary working cells used by I\ T ARP; 

each one is described in the 1YIEP listing. Thty are called 

"normal" to distinguish them from "recursive" temps. 
recursive temps : These are temporary working ceils used by routines 

that are called recursively (e.g., the expression ^valuator, 

■'j\ri\ } - 
recursi ve subroutine links : These are the cells where return 

addresses for subroutine calls are stored. The recursive 

subroutines are distinguished because their return addresses 

must be saved when the subroutines are called recursively. 
nornrl subrou t ine l inks: Same as recursive subroutine links but 

for non-recursive subroutines. 
freeze area : When FRJ5EZE is encountered, information about the 

various tables in NARP is stored in this area. 
ta,b.1e bound s: Pointers and upper bounds for the variable tables 

used by NARP (cvain table through pile) are stored here. See 

3AP? listing for format. 
initial reference tobies : These are tables used to make main 

cable lookups fast. 
main table : Symbols, opcodes, and literals are stored here, 
string storage: Macro bodies, macro arguments, and repeat bodies 

are stored here. 
expression table : Undefined expressions are stored here until 

they become defined. 
input poi n ter stack : The input pointer is stacked here when input 

is switched from one source to another (e.g., from an input 

file to a macro), 
character stack : Characters are stacked here during token 

recognition. 
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o pe r and s t a c k : Operands are stacked here during expression \_J 

evaluation. 
operator sta ck: Operators are stacked here during expression 

evaluation. 
pile : Recursive temps and recursive subroutine links are stored 

here when a recursive subroutine call occurs. This table has 

no pre-seb length but simply takes up all the core in the first 

five blocks that Is left over after the other tables and areas 

are defined, 
input buffer: Blocks from the input file are read into this area. 
output buffer : Blocks to be written on the output file are 

stored here. 

2 - 2 N--^P As A Pure Procedure 

MRP is coded as a pure procedure (i.e., it Is re-entrant) 
so that several users may use the same copy of MRP at the same 
time (of course, each user has his own working storage). Since 
some temps used by MRP have initial values, it is necessary to 
have some means of getting these values initialized for each user. 
In MRP this is done simply by Including an extra block on the 
drum which contains all the initial values for the initialized 
temps, as well as the pop links (see Section 7 for a description 
of what happens when MRP is started) . 

Subroutine linkage is handled as in the following example." 

SUB ZRO RETURN ADDRESS SAVED HERE "] in read-write memory 
BRU ESUB J 

ESUB < body of subroutine > 1 . , 

BRR SUB J in read ~ onl y memory 

This scheme uses no more memory cells than that using SBRM and 
SBRR and has the advantage of being faster. The only drawback 
is that one must remember to precede a subroutine name with an 
'E' when writing the subroutine. A call of the above subroutine 
looks like 

BRM SUB /*> 
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Assume that the subroutine SUB is to be added to the vjv 
package of NARP; it should be done as follow?,: 

1. Insert oho name SUB in either the macro KSTfiG or the muero 
SSTRG (depending on whether SUB will be called recursively')- 
Those two macros are the very first macros defined in the 
Dh'F package. 

2. Insert the code for the subroutine in the MAC package. Don't 
forget that the first instruction of the subroutine is labelled 
3SU3, not SUB. 

3. At the end of the MAC package insert ESUB in the list of 
external symbols. 



; ■ Input/Output 
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3-1 Input 

The input source program is read blockwise by GETBLK in 
MISC, and MRP itself does the unpacking of characters. This 
unpacking is done by so-called "altered routines" which are in the 
read-write portion of KARP because some instructions in these 
routines are altered to make the unpacking as fast as possible. 
The four unpacking routines are FILE, UNPCK, DUXI, and DIGI (see 
HARP listing for when which routine is used), but they are all 
called indirectly by ERM* INPUT., so most of the time the main part 
of HARP does not know where characters are coming from. 

Since macro calls may occur within macro calls to an 
arbitrary depth, information about where the input is coming 
from must be stacked and unstacked. This is done in the input 
pointer stack (by STKI and USTKI in MISC), which has the following 
format : 

o 9 10 23 



length 


input subroutine 


x y // ■ / / / /y 


word address 


char 

. . .... 



o 







7 8 



21 22 23 



length ; This is only applicable when the input subroutine is 

DUMI; it is the length of the part of the argument that is yet 

to be processed. 
input subroutine : This is the address of the input subroutine, i.e., 

one of FILE, UNPCK, DUMI, or DIGI. 
word address : This is the address of the word from which the next 

character will be taken when this input source is unstacked. 
char : This is the position of the next character within the 

word specified by word address; it is not applicable for DIGI. 
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Although RRM-x- INPUT delivers the next character, thiy is 
seldom what is wanted. Thus, ?7M GNLC is uzed in,-; bead because 
GNLC (in CWNT'^L) does ;;ome preprocessing; (liko weeding oul 
most character:: -> 77B) that is usually necessary no matter which 
oart of NABP is asking tor the next character. (Currently the 
only places where 3RM* INPUT is used are in GNLC and in .1CHAP 
(in MISC). Due to later changes to GNLC (to make it handle IjpB 
characters in two different ways depending on a flag) it is now 
possible to replace BRM* INPUT by BRM GNLC in the routine 1CIIAR 
(of course, other changes should be made to .LCNAR at the same time). 
?rom a logical point of view it would be wise to make these 
changes.) 

3.2 Output 

Binary program produced by NARP is output in blocks (by 
OUTB in MISC). The house-keeping needed to organize the binary 
program into blocks preceded by 3-bit codes is done by the 
routines BPB', MWC, SWC, FF, and OVAL in MISC. 
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U.O Basi'c Processes 



''"•■'• T| °ckup of Sy mbols, Opcodes, rr,& L iters!,--, ( Including format 
of main tabic) 

An entry in the main table consists of four words, representing 
a symbol, an opcode, or a literal. These entries are arranged so 
that the first two words and the address part of the third word 
are the same (as far as the lookup routine is concerned) for the 
three different types of entries (see Figures U-l, U-2, and U-3). 

As new symbols, opcodes, and literals are encountered they are 
added at the end of the main table, so if scanned linearly this 
table contains symbols, opcodes, and literals all mixed together 
in an unpredictable order. However, this table is not scanned 
linearly; its various components a.re linked together via a "table 
link," the address part of the third word of each entry. Thus, 
for example, it would be possible to link aU. symbols together 
in one chain, all opcodes together in another chain, and all 
literals together in yet a third chain. Then when a symbol 
was to be looked up, the lookup routine would only need the 
symbol in question and the address of the first symbol in the 
main table. By following the table link from this first symbol, 
all symbols in the main table could be examined. This, in fact, 
is precisely how literals are handled, i.e., all literals are 
linked together in one chain, and when a literal is to be looked 
up this chain is followed from the first literal in the table 
until either a match is found or the chain ends (in which case 
the literal being looked up is not in the table). 

Although opcodes and symbols could be looked up in the same 
way, it is much faster to organize things slightly differently. 
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Figure h 
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Format of 


a Symbol in the Main Table 







5 6 


11 12 17 18 23 


A 


B 


[CD! 


E 


F 


fm^mm, 


<V''jT!J 


£ 


rfactor 


} table link 






value 


of symbol 



0123 h 5 



9 10 



23 



O 



ABCDEF: This is the character string; forming the symbol; if it 
is not six characters long, then there is blank fill on 
the right. The shaded area is all zeroes. 

symbol is defined 

symbol is equated 



r 



symbol is to be forgotten \ These are all booleans; 1 means 

, , . . , I true. means false, 

symbol is generated ' r 

o'/mbol is external 



rfactor: This is the relocation factor for the symbol; it is 

stored as a signed, two's complement integer with the 
sign in the first bit (thus, if the third word of the 
above entry is in A, LSI! 5; RSH 19 will leave the rfactor 
in A). 

table link: This is the address of the second word of another 
symbol in the main table, a symbol with the same hash 
code as ABCDEF. 

value: If the symbol is defined, then this is its value. If 

the symbol is undefined, then the value word is interpreted 
as follows: 



1 9 


10 




23 


| „ V////////A 




Y~ 





if«r= then fi, = f> -»T"is meaningless (0) 

f> = 1 ~-> Y~is the address of the last output instruction 
referrin % to ABCDEF. 

if <r= 1 then^ is meaningless (0) and X is the address of the first 

occurence of the symbol ABCDEF in the expression table, 
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Figure 4-2: 


Format of an Ope 


:ode in the Main Table 


c > 6 


11 


1? 


37 18 23 


A 


B 




I 

j 


! 
C | D 


E F 




\// t 




4\fi\:%Mfi\x ' $ f 






i 

table lint: i 


value 


of 


opcode 



12 456 789 10 



o 



23 



ABCDEF: This is the character string forming the opcode; see 

Figure 4-1. 

cH = 0: 
^ = 1: 



^ = 0: 



a 



= 1: 
0: 

r : 



non-shift opcode ^« „,„„ „,, ,, -.oxj -> ^ 
, . _, / r see MRP Reference Manuai, Section 3-0 
shift opcode __} ' 

do not output opcode definition (the case for macros and 

pre-defined opcodes) 
output opcoae definition 

This bit is always zero so that it, iT, and i can be accessed 

together in analogy with the rfactor of a symbol. 

Class of opcode: - opersnd optional 

1 - no operand 

2 - operand required 

- instruction 

1 - directive 

2 - macro 



Type of opcode : 



Table link: This is the address of the second word of another 
opcode in the main table, an opcode with the same hash code 
as ABCDEF. 
value: the interpretation of this word depends on the type of the opcode: 
instruction: value of the instruction, 
directive: address of the routine in NARP for processing the 

directive . 
macro: address of the first word of .the macro head in string 
storage (see Section 6.5) 
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* ? * P, ure ]j~ll Fo rmat of a Literal In The Main Table 












23 






value of the . 


Literal 






r factor of 


the 


literal J 




table link 




1 )' 


V//////MA 


(* 1 



12 



9 10 



23 



value: This is the value of the literal. 

rf actor: This is the relocation factor of the literal. 

table link: This is the address of the second word of another 

literal in the main table. 
^ = 0: {? is meaningless (0). 
r>> = 1: fS is the address of the last output instruction referring 

to the literal. 
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Figure k-k : Format of an Initial Reference Table- Entry 
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Initial contents of an entry: 



«<. 



y> 




'/ 


\ 




/'V//, 


////■'///, 




, /, 


/. 


'//A 



9 10 



23 



o< : This is simply the address of the word itself. Thus, the 
first word of an initial reference table entry simply 
contains its own address. The shaded areas are zero. 



Contents of an active entry: 



o 



p 






r 







9 10 



23 



ft : Address of the second word of the last main table entry 

in a chain. 
y : Address of the second word of the first main table entry 

in a chain. 
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Namely, when a symbol is to be looked up, a function is aoplicd 
to the character string forming the symbol to yield an integer 
called the hash code for the symbol. This -jQ.ti~r.~r "a even and is 
in the range !0,t>2], and is used to access n-: vrotial reference 
table (see Figure k-k) . An initial reference ttdle is essentially 
a linear ;<;~ray of pointers to the bo-innings mid endifgt; a" 
chains in the main tabic. Thus, instead of having «.ll symbols 
on one chain, they are distributed over J>? chains, and nil the 
symbols on a r>;iven chain have the same hash code, Thac, t.he 
number of symbols that must be looked at before discovering i: 
a given symbol is in the table is reduced from n to fan the 
average) -^ , where n is the number of symbols in the table 



: -f 



(this, of course, assumes that the function distributes symbols 
evenly over the 32 possible hash codes). The lookup works as 
follows: (input is a symbol and the address of the base of an 
initial reference table.) 

1. Compute the hash code for the symbol and add it to the base 
of the initial reference table given as input. Thus, two 
words are accessed which delimit the chain on which the 
symbol must be, if it is in the table at all. 

2. Alter the table link of the last symbol in the chain so that 
it points to the symbol being looked up. (Note-: The symbol 
to be looked up is always placed at the end of the main table 
before calling the lookup routine). This essentially adds 
one more element to the end of the chain, namely the symbol 
to be looked up. 

3- Now start at the first symbol in the chain and follow the 
chain looking for a symbol equal to the symbol being looked 
up. It is guaranteed that such a symbol will be found 
because it is always the last symbol on the chain. 

k. When the symbol being searched for is found, check if it is 
the last symbol on the chain. If not, the symbol being 
looked up is in the table and has been found. On the other 
hand, if it is the last symbol, then the symbol being looked 
no is T-ot in the table. 
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5. In case the looked up symbol is not found, it is usually 
added to the table. This is done s:b.ip.ly by changing the 
initial reference -cable entry which points to the previous 
last symbol on the chain so that it now points to the symbol 
at the end of Ihe main table. In case the looked up symbol 
is i-.oz to be added to the &airi table then no notion need be 
taken (this means, in other words , that the table link of the 
last symbol on a chain may point anywhere ) . 
The lookup of an opcode works in precisely the game way except 
that a different initial reference table is used. Literals: are 
also looked up in this way, but the hash code is always y.oro and 
the initial reference table only contains one entry, (it would 
be possible to speed up the literal lookup by computing scree sort 
of hash code on the basis of tho magnitude of the literal.) 

k.2 Token Recognition 

The input to MRP is essentially a stream of characters that 
must be analyzed to find which sequences of them form symbols , 
which form numbers, etc. For the purposes of this description, 
a token is a symbol, a number, or a single non- alphanumeric 
character. The basic routine for recognizing tokens is GISTS 
(in CEOTRL). 

The logic used in GKE is displayed by means of a state table 
in Figure k- 5 . With this mechanisai, a string of characters can 
be analyzed by using an integer called the mode (or state) to 
describe the kind of string processed so far. A new character 
in the string selects a colvcan in tho table , while the current 
mode selects a row; the table entry common to the column and row 
contains the new mode. This process is repeated until a non- 
alphanumeric character is encountered, at which time the label 
indicated by the rightmost column of the table and the current 
mode is jumped to. Here is an example: (Note; Blanks have no 
meaning in this eicsrrple) 

input string: 1 K 6 B 3 A * /""""h 

mode: 2 1110-2-3 sytab 









u-a 



Figure h - t j : State Table for Token Recognition 




j j<ail otheil 

! 0,1, ... ,9 ; characters 



-3 



i 



ud 



1 










> 


jin a digit 


-3 





-1 


1 


nc ! 


l string 










i 













i 


after 


-3 


-3 


-3 


-2 


nsb | 


<digits>B 










i 


-1 










! 


after 
<digits>D 


-3 


-3 


-3 


-3 


nsd 


-2 












after 


-j 


-3 


-3 


-3 


nsbn 


-3 

! in a 


-3 


-3 


-3 


-3 


syrab 


symbol 













comment the mode is initially 2 (first character); 

symb: pack symbol; compute hash code; goto symbol exit; 

nsbn: gr>el*-8; comment gnel contains the radix for the number; 

scale :=last digit stored; chp:=chp-2; comment chp points to 
characters; goto B2; 

nsd: gnel: =10; goto Dl; 

nsb: gnei;~8; Dl: chp:»chp-l; got o El; 

ns: ,'^nel: --radix; El : scale :=0; E2: concatenate number using 

gnel as radix and checking that all digits are less than 
the radix; goto number exit; 



ud: 



goto other exit; 



c. 
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Note that In this example it la not clear that 1U6B3A is a symbol 
until the very las-c character, 'A 1 , is read. 

Although the state table could be implemented by having a 
two-dimensional array of words containing L,be new modes, it is 
less space -consuming and probably just as fast to do it in a 
more ad hoc fashion as is done in HARP. For example, v/hen an 
input character r.alects the leftmost column of the table it is 
not necessary to aso the current mode to select a row because 
the new luodc is always -3 no matter what the current mode is. 



The selection of a column is done by 



;he irrout character 



-co access a linear array of 6k words (CTAB) which contains the 
addresses of five pieces of code corresponding to the five 
columns in the table. Thus, in the above example, when the 
first character of ]1'-6B3A is read, it is used to access CTAB, 
where the label DIGIT is found. A jump to DIGIT (in CdG) is 
made, and •chore i;he current mode is looked at to determine what 
the new mode should be. 

The format of CTAB is: 



o 











^5 


9 10 




C3 


! 


r:K 


! ^ 




r 





Y 



used by statement processor (see CL00P, Section 5) 

used by EXPRI (see Section U.3) 

address of code in GNE (NOTED, BLET, DLET, DIGIT, or OTH) 



At the bottom of Figure 4-5 is a sketch of the processing 
done by GNE after a token is recognized. It should be noted 
that as characters are read by GKE they are stored in the 
character stack (pointed to by CHP) . This stack works recursively 
because GNE is a recursive subroutine. 



c 



u 
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t . j i-;xr>rer;s-ons 



CD 



A AAPP expression is essentially a string of symbols arid 
constants ccrxeoccd by operators. The main routine /o? evaluating 
expressions is ;\XPi\' (in CFiMTRL) which also chocks, their syntax 
and. the compatibility of the rfactors of ih;eir terms. 

An auxiliary routine, EXPRI, is called by EXPR to get the 
next syntactical element for EXPR. The flowchart in Figure a. 6 
gives an outline of the workings of "Dif-hl. Mote th\rc EXPRI uses 
a part of CTAB (see Section k.2 for for;;. at) to access another 
table, r-.ar.cly OPATD (see Figure h-7) . Although entries in 
GP»vBD describe operators and are used by EXPR \<>^a the operations 
are bo be performed, one field in OH0Q (EXPRI type) is used by 
EXPRI,, both to branch to actions within EXPRI and to be delivered 
in the A register to tell EX PR what kind of syntactical element 
was delimited. EXPRI also puts information in the operand stack 
(sorae-olmec called anstk) and in the operator stack (sonetimes 
called atstk). 

The syntax of an expression is checked by a state table 
(see Section k.2). lii this case-., the state table is implemented 
by having a two-dimensional array of words, indexed rov-wice by 
the current raode and column-wise by the type of the syntactic 
element (delivered to EXPR by EXPRI) . The state table is a 
bit more complex than for token recognition since each entry 
contains the address of a piece of code as well as new mode; 
after the mode is changed the piece of code is executed. Below 
is a pseudo-ALGOL description of the actions of the code pieces; 
refer to the NARP listing for the state table and for further 
details. 



next syntactical element 



plus or emus 

negate or binary op. 
symbol or constant 



next: EXPRI; snmode : ==new mode ; goto 
action determined by old mode 
and EXPRI type; 

pm: make ncwop a unary operator instead 

of a binary one; 
nt:bo: STACK!; .goto next; 
sc : increment anp so the value delivered 

by EXPRI is aov in -fne operand 

stack; goto next; 



Flowers r', £<,;■ b/P^X^fjn C 

r 



■>:hl) 



\ entry 

V / 



-\L. 



lookup ojfinbol in 

main ir-ble 



add sy.vibol to 
i main tattle 



\r 



; place velue of 
! symbol (may be 
"^ undefined) in 
j operand stack 



U 







i place value of i 

i 

! imrber in operand | 

I SofiCK I 



"v- 



J 



See Figure it- lit for more detail? 
of this action 



o 




_J/_ 



'. delimit a string j 

! and place its j 

' value in operand | 

! stack i 




O 



! ■" 

i using p/pvT p^rt of | 

jCTAB voril for the"othe^ 
;in the linear trr.- v ! 



Ifrora OP:.TD v ord e ,.ti,ci\ 
,£.^.'^-1.1 uy,:-e t-a;,. f>"; , ; eh i 
,'to e piece 
jEXFRT 



eoe . in 



pL&ce PC in operand 
stack er:c' pJr.ce 
iral. op in operator 
stack 



piece div. op. 
in operrtor sleek 



exit 



exit 



c 



il-TP 



c 



Ljj 



Figure k-'{: Format of OPWRT) Tabic 



I. 2 8 9 11 12 ]A I',, 19 20 23 



Kir* 



J 



o^ : rf actor of result (0-absolute, 1-found by applying operator 

to rf actors ) 

fi : decree of operator (0-binary, 1 -unary) 

y : EXFRI type (used to branch to a piece of code In EXPRI; see 

NARP listing) 

^ : rf actor of operands (0 - all absolute, l-arbitrary, 2-arbitrary 

but equal; 3-<-t least one la 
absolute) 

<f : .rcbopcode: used to distinguish the various relational operators 
(J £ : opcode: used (by EXFR) to access OFTAE when operator is applied 

to its operands 

£ : hierarchy (also called precedence) of the operator 
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Figure U-8: Format of An Entry in the Operand Stack 



O 



Entry is a place holder: 



<? 



Entry is an 'undefined symbol: 



1 of th,.' mcin Wole entry 
j for the sy-ioci 


: 


> i 



■Intry is a defined value: 



1 

! value 


as 


a s: 


.gned 


integer 


j r fact or 


as a signed 


integer 


2 

1 



Entry is a defined literal: 



address of fourth word 
of the main table entry 
for the literal 







Kbte that the third word determin. ~ what kind of entry it is. 
second word is only used for defined values. 



The 






k-lk 



Figure 4-9: Organization of the Operand and Operator Stacks 



table 
"bounds 



r ckp 

UCKP 

i 

1 AN? 

i 
■ 

/ UANP 
ATP 
UATP 



I 



I PIP 

1 

i 

L_ upip 



! 
1 




upper go end o;.* character 
! eoincer is fixed) 


S'CaCX 


r-, 




1 


' operands Jtack •oointer 




, 


J 



; upp n~ bourn of operand stac;-: *, Lhiti po^ '.cr 
j i :< V ised.) __ 

; operator stack pointer 



upper ooona oi 
is fixed) 


operator irenok 


i, this 


no inter i 
I 


pile pointer 1 





v/ v ^ 



O 



o 



operand 
stack 



ANP2 



ANPlf- 



r 



operator 
stack 



this word is never used 




~| first opera- 
nt or goes 
here 



In this figure the "oo inters AiTP and ATP indicate that the stacks are 
in the neutral state, i.e.. notrnng is t> ~^.-j; on chat u^es the stacks. 
These two pointers always point to the last active word in their respective 

stacks . 
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enael iiifjn J oq: noke novop a literal operator instead of a 

1 rub;! local ooeraoor-. 

left parentheses I Ip: increment atp «md put newop in the oocrator 



u / 



SuClC4't I7O0O n : ~cco c 



right parenthesis I rp: STAC1; deercrr„c:it atp twice -co get rid of [] 
( ] ) ! in the operator stack ; goto next ; 



exit 



ex; STACK; exit from BXPdg 



c 



Of course, the "exit from EXT>B" covers a lot of details, particu- 
larly about undefined expressions; this is discussed in the KARP 

") •; oJ »•,„ „ „,..,i .•;„ <-<.,, n j. 4 „.„, Ii h / ' c.nn ci "!'-.- tfi -n> -it ~> c ; i' 1 ,d ir-ri ll_Vi^ 

J_i £* i> m .., *-*->.~ 1;-. uOC./iOi: ^r . L r . \.i0^tr cl~ — w ^ i^Lu ,~ ^-^^ S ™ j-Uj cuaCI **lf J« 

The subroutine STACK does the actual evsf-uating, applying 
the operators In the ouorator ;jtack to the operands in the 
operand clock; . Toe compatibility of rf actors" is also checked by 
this roircine. If an operator is to be applied to an undefined 



vo lue . 



special actions mud be taken as is discussed in Section 



k.k. See the NARP listing for more comments on STACK. Figures 
4-8 and 4-9 give detail « of the two stacks involved; an entry 
in the operator stack is precisely the same as that in OPWRD 
(Figure k-'f) except that the HXPRI type is alt-rays zero. 

T.ne best way to ret an idea of how EXPP works is to try 
a few examples by kar4. Below is an example showing the changes 
in the stacks as "A+B'*[C+D]-3j" is evaluated (assume A=I, B=2 ? 
C=3, D=U-). 



O 



Ope re nd St ac k 



Operator Stack 



Comments 



( 





A 


+ 




J.* 


->r 




B 


■f 


'*' has greater 
hierarchy toon does 


A 

B 


f 


' [ ! is always simply 
stacked i/ionout eoir,- 
pnri eg hierarchies 


A 


1 





X«^ 






1+-I6 



Oosrand Stack 



Crcsrutor Stack 



Corcrr.ents 



B 
C 



C 
+ 



A 
B 
C 
D 



' ] ' poshes all operators 
out of the stack through 
'['; 7=C+D 



<D 



A 



first '-' pushes out '*' ; 
lif=B-x-[C+D] 



1 


1 
! 


i 


then '-' pushes out ' +*; 
15=A+B-x-[C+D] 


f 


- 


! 12 

1 

i 


' ; ' pushes out ' -' ; 
12=A+B*[C+D]-3 

_ i 



CD 
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o o 

^ * ^ handling Undef ined Quantities ( inc.li 'ding Xitora 1 c ) 

There are essentially three different kinds of undefined 
quantities treated by ft?, HP.- *' 

1. An expression consisting on one symbol which is undefined. 
This is usually referred to as the undefined symbol case. 

2. An expression consisting of more than one symbol (i.e., a 
symbol and an operator, several symbols and operators, etc.) 
which contains at least one undefined symbol. This is 
called the undefined expression case, although strictly 
speaking, 1. is also an undefined expression. 

3- A literal. 

U.l;-.l Undefined Symbols 



o 



The symbol is placed in the main table (if ro is not already 
there), marked as undefined. The "value" of the symbol is the 
address of the instruction for which the symbol is an operand 
(see Figure J-S--1). The second time the symbol is referred to, the 
"value" is output with the current instruction, and then the 
address of the current instruction is placed in the "value." 
Thus, a chain is created with its head stored in the main table. 
For example, if U is undefined, one might have: 

Address Sourc e Q- :f out 

40;2< ALPHA LM U IDA kp'p*- 1 

500 Alb U At® kfifa -, 

\ \ " 1 

SUB U SUB 500< 1 

LDX U LDX 600^ 1 



vat this point, the "value" of U in the main table 

is 700 r-J 



When the undefined symbol is finally defined, a special piece 
of output is generated (labfix head) which tells DDT to follow 
the chain beginning at "head," replacing each link by the current 
value of the location counter. The end of the chain occurs when 






{J 



o 






the link In an instruction js equal to the address of the instruc- 
tion. (Sous assemblers indicate the end of a chain by a zero 
link, but this cieans, however... that in cert.. " en,;;:: an ur,d<-f 3 r.ed 
symbol in the firct instruction ol* a proyra, ".-.'ill rot ho treated 
correctly.) To continue viih the rbovc exaiayUr, if U hdlA -1 
appears at location d>f/5, then 

labfix 700 

is output. hDT traces down the chain, replacing tic ii -.ho by 
the current location counter, llpp. The proofs then, loots like 



1 / k/ / ^ 


LEA 


8;:;J 


5 00 


hli) 


OV/'W 


600 


SUB 


\>p'jj 


7# 


TT\V 


80*5 


800 




-i 



c 



It io clear from the fore-going that MRP doss not handle- forward 
references completely by itself, "hut in a sense utilizes the loader 
as a sort of second pass. This scheme works because the address 
parr of Ire instruction 'with the undefined sy&bol can be used 
to hold information, for the loader, namely a lick- 

k . k , 2 Undo f in ed Exore s p io ns 

However, if the operand for an instruction is undefined and 
constats of no re ttrm a single symbol (e.g., -U, U+l) then there 
is not enough room in tho instruction to hold oil the information, 
especially cinee the undefined, expression may bo very long and 
raay contain many undefined symbols. To solve this problem, 
undefined expressions are stored in a table in core (called the 
expression table or et&b, pointed to by ETp) . When all the 
symbols in a elver stored expression -oecoac defined^ the expression 
is evaluated and output to DDT. One other cospiieation enters, 
however, namely that there it, not enou.ph room in the main table 
entry for both the head of on. undefined chain and a pointer to 
the expression table. Thus, a bit in the "value" vord is set 



^ -19 

to indicate that this word points to the expression table (see ^— ^ 

Figure U-l). The word pointed to in the expression table 

represents the occurrence of the undefined symbol In fin exoression; , 

it contains either the head of an undefined chain or else points 

to the next occurrence of the undefined symbol in a caved 

expression (see example below). 

I'D should be clear that the case of U as a single undefined 
symbol is kept completely separate fron the case of U as an 
undefined egression. The occurrence of U in an undefined 
expression does not at all effect the undefined chain that is 
output to DDT. The only difference is internal to NA3?, namely 
that the head of the chain may not be in the raain table but nay 
be at the end of the chain running through the expression table; 
it will simply take a bit longer to retrieve it than if it were 
in the main table. 

The expressions are stored by TaXFE in Polish post-fix 
form (see Figure H- 10) alon<? with the address of the instruction ._ 

where the expression occurred. When an undefined symbol that v-/ 

appears in the expression table is defined, the following action 
is taken (by the routine FOLRC, see Figure 4-13): 

1. The chain running through the expression table is followed 
and each link is replaced by a pointer to the symbol in 
the main table (it is not possible to put the actual value 
of the symbol in the expression table because defined values 
need three words in this table and there ir> only one available). 

2. Tn each expression where the symbol appears, the undefined 
symbol count is decremented, and if it becomes zero (indicating 
that all the undefinca symbols in the expression are now 
defined) the expression is evaluated and it 3 value is output 
followed by either 

fix 1^ saved lc 
or 

fix 2k saved lc 

These cause DDT to fix up the instruction at "saved lc" with ,™ x 

the last output value. ^— ' 



c 



l'_On 



Firun-e H- 10: Poraab of Saved TVnreia, lonn 



Bits b2~bi; indicate the type of the word stored is. the ei 
The shaded areas are zeroes . 



Ope rat or : 



2 2 h 5 89 11 12 I 5 J 15 19 ?0 23 



^! 



H VJ 



r 'i 



d 



i 



5 



V', ^/^ J' -5 : See OFWRD table format, Figure 4-?. 



Defined value: ,' 



1 £ 


> 4 


t 


01 


- -' i 


1 


y y / A . 

■ / / y s 


Ayy^AyyAyyAAAAA 


j 

I rfactor j 


i ! 
j value j 



G 



Undefined 
syiabol: 



12 4 5 9 10 




23 


1 " 1 ' 1 ' ■ y y y y j - ■ 
\J\A 2 yyy-yyA 1 ' 


T 


... ' 



Saav as <s~,{° , X~ in Figure k-1 except; that symbol is not 
necessarily ABCDEF, and x~does not point to the first 
occurrence of the svmbol in the ot_b. 



12 


k 5 9 10 

_ _____ 




23 


\yyy\ " 
Y//A 3 


AAA AAA 


c~a 





Pointer to 

defined 

value : 
0/ : Address of third word of a defined symbol in the main table. 



X5&— ^ 



End-expres- 
sion: 



f: 



1 


2 h *> 9 


10 




23 


\..A 
\y jp 


■■- ■■■■"" > ■". y:y < 

1 ~~ 7* j 

; j circ ;. 




loe ent 


1 


V//, 


Y/////A 




6 


i 
1 

! 
1 



oi : means that the value of the expression should be computed 
;,.od 2- 1 - 4 ,, 1 raeans mod 2?-"'' . 

f! : raear.s that loc ent is meaningless if)), I mesne it has meaning. 

udef ent: number of undefined synbols in tne exnrejeicn. 

loc ent: address of the Instruction an which the egression appeared. 

Jf : If X raa,iet,er eoahaira .,;/„« adarecc of ihie word,; then KAX' p ,2 
makes h poh'ic to the i"Lr_t wc-d ct tae ■axa.-c.s.'.ion in the etat 
(thus the vord < contains 1+ the negative oO the number of words 
in the etab entry., with b0~b2 and b3-b9 zeroed out). 



!+-21 



i'i" 



h~ T ( cqrvbirr. r a ) : 



'■x 01 .Sr.ven. 



roY-es.TLons 



Vacant area: 



k 5 



I// 



V 



9 10 



2.3 



/I 



ahor of words In 



be etab entry; after ;~n expression In 



the eU6 becomes do 
the first word of the entry ic replaced by the above word 
to aid in p;arb." r-,e collecting (.oe Gecoion 8); the sbove 
wore Is in cell/* , then i^ <■-< is the address of the first 
word of the next expression in the etab. 









\SJ 
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i. When tbe end of the expression table chain is reached, its 
contents (the head of an undefined ch>:.in) are used to output 
"labfix head" as described above in Section ^;.b.l. 



Tb\T, r n*oLe 



! 



Main 
Source Oaby.d; F:<r> re aslon Table ctr.b Address Table 



im u ^lba^J D'^^.-2i±f^J~ ~ J <-typ- GO ' r 



'?0 1-3 -U ■ LDB 5^0 <—)'f'rjT£\^J_JIslL. 



t/ 



■'////A 



L~ ti ! gfij^f To&pS""] 



66<d SUB u r*SUBhffi r': r 2-jn^,i',J '■ ",. .. _ u < - Jl ^« 

700 LDX U-rl8 | LDX 7^ < — y,' ^ ~'.\ . ? \ lQ 



Sf4 ADD U 



r 



|_ ADD 600 c— 1 Ij^nLiLLld 






flow if U is defined, say by U DATA -3 at location 9^0 then the 
following is output: 

sex;rel -1 (sets special relocation factor) 

-90 (maricefi as special relocation; value of -U) 

fix Ik ~, r 4 (points to location where -U appeared) 

91.8 (value of U+18) 

fix 3. 1 ' 7^0 (points to location where U+18 appeared) 

labfix 800 (points to start of undefined chain) 

-3 (contents of word labelled U) 
After DDT finished processing the ubove, the result in core is 

\¥f$ LEA 9# 

500 LD3 -900 

6ifi SIB $pp 

LDX 918 



80£ ADD 900 

90^ -3 
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o 



^ • h . 3 hitore 1 



is 



Q. 



The literal operator '--=•' takes a value as itc operand and 
producer una Adc.^er.g of this value. Since- literals .-.re- outrun 
at the end of the program being assu.„b.led 5 the address of a 
literal 1r. unlcnora until the end of th; progmr.i, so that reference: 
to literals are logically the same as references to undefined 
cymbals • There are two cases ; 
^- 11~.2 ^.' n . rc . s ... : -l^ on - r; follow? ng rK-- literal operator ia defined: 



l" 



n clxs c<u 



ine action of the literd operator (in bXPb) 



is to look xro the literal in the main table (aiding it to 
thin table if it is not already there) and return the fourth 
uord of the sain tabic entry as the value. Thio vord ic 
ahaocl the came as the fourth word of an entry for an undefined 
symbol, i.e., it is the head of a chain of references to the 
literal. The only exception is that 10 is aivays because 
this -road never points to the expression table. 
2 - Th9..^^ T ^J'io^ followin g tho literal operator is undefined : a"~a 

in this case the literal ic placed in the expression table ^ 

,iust as any othsr undefined &xpre scion. Wien all the synbols 
in the expression beccrre defined, it ic evaluated and processed 
p" described above in 1. and in Section h.h,2. 
When the USD directive is encountered, the literals in the 
sain table are output one by one, each preceded by a labfix so 
that DDT will fix up all the references to these literals. After 
this the expression table is scanned. Ail expression in this 
tabic except those of the form -symbol cause error messages 
because they ore expressions involving external symbols, (if DDT 
is every changed so that it can handle expressions in Polish 
fom 3 then these expressions will ail be output to DDT.) Expressions 
of the form ^symbol are external literals: they are handled by 
out putting 

labfix address of instruction referring to the literal 
followed by the equivalent of 

r\ 



h-2k 



The onl 



>nly drawback to this scheme is that every occurrence in 
tbo program of ^symbol (where symbol Is undefined) Is pieced in 
the expression table. If EXPR were changed to recogvized 
=symboI as a special cascg a less space -consuming scheme for 
processing external literals might be devised. 

h.k. k Flowcharts for Routines That H.v ndle Undefined Quantities 

The observant reader will have noted that many of the pointers 
connected with handling undefined quantities arc- similar. The 
fo 2 lowing four words Iiave essentially the same format as far as 
bits hi end bi0-o23 are concerned: 
1. The fourth word of a main table entry for an undefined symbol 



/-. 



if Tare 



k-1) , 



2. An undefined symbol word In "che expression table (Figure ^-10). 
3- An end-expression word in the expression table (Figure k— 10). 
-! . The fourth word of a main table entry for a literal (Figure ^-3). 
( )J For the first two above, b0 has a common meaning; If It is 1, then 

blO-b23 point to an undefined symbol word in the expression 
table, while If It is 0, then b!0~b23 is the head of an undefined 
chain. In case 3- and k. b!0-b23 Is also the head of an undefined 
chain (although In case 3- this chain always has only one link). 
In all four cases, if bl is 0, then blQ-b23 Is meaningless; it Is 
to be ignored. The reason this case may arise is that an undefined 
quantity may appear in a wrong context , but this is not discovered 
until entries have been made in the main table and the expression 
table. Thus, it Is easier to set a bit than to undo what has 
been done. 

Another format that is used in several places is that of an 
operator :. 

1. In the OFWRD table (Figure k-7) . 

2. In the operator stack (Section k.3)- 

3. In the expression table (Figure 1:--10). 

The foi3.ovri.r5g routine? are flowchartcd in considerable detail. 
) J See the XA.H? listing for a specification of the input to the routines. 



k-2o 

1. DSTLB: This routine defines lelel,;. >K ~^ 

2. ACGK: This routine defines equated symbols. 

3- POLKC: This is an auxiliary routir.e used by both DhFLB <vnd >, 

ASCK to process the expression table when an undefined symbol 
is defined. 






C 






c 



yip.uv,,. 2)-- 31; Flowchart of II'VZP, (in CKyy-lT. ) 



'f-?6 




look up label ana 
If nob found, add 
it to r-cin table 
wi th word 3 • - 
word k:-fi 



I Istdef used 
ivhen typing 

; symbolic address 
inerror .mess . 






taoink; — word 3 

val:=vrord k 

Istdef j-addr, 
(wordH ) 



|defined:-l 

{equated: =0 

'forget : -~fi 

jge oerat ea ;, ~i{ 1 if gnbool 1 

external : ~oxf j 

rfactor: -l(«5 if rfbool) 

value : -loccnt 

table link not disturbed 







l ^»- / 






Figure U-Il: Flowchart of D?,?L'3 (in CKiTOL ) 



vcono 



I) 



"1 



error ('I)') fr 2 ^— 




C 




"Lblnlc <f> 
(symbol previous 
ly defined? ) 




bl of val is 
(i.e. blO-b23 of 
va3 is head of ary 
undefined 
chain 



takes val as 
input and alters 
it and gives it 
also as output 








j output ^lcbfixj 
Ihead of chain) | 




W 



r f > H-1? 



u-tgure '}■■ 



Flowch art of ASCN (in filF.KCT) 




3-oox up syifihoi ;;n( 
j If not f(r«::i,i then | 
j a;j,; It to fit.* tf.u in i 
j U'.ixle with vor<i 3 : j 

j - word h : '/) i 



V>-' 



;or a s r> ion , r, ec inf; 
lit in dct'CroG 



jxtra:= exf 
libi :-- ex i : = 

ll „ „+, ... 1, 




Au 


©! j : -- 


A 




Kr 


ACTOR : 


= B 






iddr of 




word k 


of 


main 




table 


antrv for 




symool 


; t; 


vblnk: 




= word 


3; 


vol:- 




word k 








S-S~ 



\J>—'' 



c- 



Tipvxc k-iy. 



hrrt of A3CF fin DTRKC/!' "> 







(cont ' d) 



; sjer.er 


j.xoa 






i shoul 
to ol 


d be 
cl p;en 


S C o 

bit 


— 


1 y gnp 


ool 







[defined:- 1 

i equated := 1 

jfrgt:- old frgt bit 

\ generated:- f) 

| external := if old ecu bit 

i la en xtra v 

| old cxt bit 
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The previous routines deal mostly with the actions when a 
symbol is defined. In contrast, the following routines handle 



bhis -joint the 



references to undefined symbols and liberals. , 
scheme used for converting uncrfineu expressions to Polish post- 
fix should be mentioned: just before f,X?h a;vL,s en oreraior to 
its operands It cheers that the operands arc Co. Tired. If they are 
not, first the operand stack, and then the operator stack, are 
scanned (in Lhe same direction as they were built \x-o) and placed 
in the expression table. At the same time ail operands in the 
operand stack are replaced by "place-holders" (see figure ';-8); 
these act as dummy operands to keep the format of the stacks in 
order. When an expression terminator i., encountered and its 
action is taken, the undefined expression will be stored in the 
expression table in Polish post -fix. 

The following flowcharts do not cover complete routines, but 
only parts of them: 
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Action on undefined expressions 

Action on undefined symbols 
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The ~"ol loving action takes p? ace when an uroofined ^"oerand in the 
operand stack 13 to be stored In the expression tabic, the action enclosed 
In the dotted box is labelled because it is referred -co in a later 
flowchart. (The flowchart below is part of the subroutine STASH in GEIH?.l.) 
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The flowchart beiov/ is of the sfhv-oaiine J If in ChffTfL. hote 
that if the operand of the liter ai operator x„~' urci-/_r>jd then this 
subroutine is not executed; instead the literal goes in etab as 
for any other undefined expression. 
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After bhc J itoral 3 ir. the main table a<\: output., the exp/vc^ion 
tdblo is scanned. A?;l expressions :n thh: t.ole .'ire oh the ferai "-X", 
whore X is cl single undefined symbol arc oua a;d; ?.:; literals; all 
other egressions in the table cause error rr.c cso^'^t, . The following 
routine is executed for ecch symbol in the main table (this is 
the routine ETEROC in DIRECT). 
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^ • ^ S tatement processor (centra?, toon of T V.; ■ ? ) , , j o n t ■n.' c L • '. on s , 
a nd n en- macro direct:' ves 

The central loop of NAKP start;, with the .hah^l C1GGP at the 
beginning of CKUTRL. Follow.: ng this label is ^ sLuuo UftJe (hlAB) 
of the same form as the one in FXfP^ which cheeks tie syntax of 
statements. In particular, trie opcode of a staicrwnt .lo loolo.-d up 
and sub sequent action is dependent on its type , 'fee central loop 
processes a statement up to the operand field nr.d then jumps to the 
appropriate action depending on the type of opcode (more details 
below). The routine jumped to then processes the operand field 
and scons up to a serai-colon or carriage return and then jumps 
back to CIGOP so that the r.oxt Gtc.terr.erit can be processed. 

Just before the ccnural loop juieps to a rcaalueg it sets A 
as follows : 

A-0 means there is no operand (if tee opcode requires 
an operand then an error message is erven \~sf the 
central, loop ) . 
A--1 means there is an operand waiting to be processed. 
The routines jumped to take the following actions; 

In s t ru. c t i on s : The code for process ins instructions is just 
after the central loop. This is rather straight- forward code, 
first checking for '<—' and '/' and then calling EXPR to get the 
value of the operand. If there is a tag then EXPR is called again. 
The instruction is built up in SNSTR and when a statement termin- 
ator is encountered it is output. (When the instruction processor 
is first entered, IKSTR already contains the value of the opcode , 
possibly an indirect bit, etc., so only the values from the operand 
field need be merged in to complete the instruction.) 

Macros ; If the opcode is a macro call then a jump to MACALL 
in MAC is made; INSTR contains the address of the first word of the 
macro head (see figure 6-2) in string storage. More details of 
the processing of macro calls are given In section 6. 

Directives : If the opcode is a directive then an ad hoc 
routine written specifically for the directive 'is jumped to. 
These routines arc divided Into non-macro directives (in Pl'liCT) 
and macro directives 'in yf.C)_, although some of the routines in 
MAC are not specifically connected with macros (lite I?) but 



V_. 



1/ ir.vol've siring processing. Tho roui'ini s In hA'J avc descr-" i .jCi i..:) 

^_„y seel. ;or, o. 'J"''\';o ; n IhihUCT arc kio'-c or less straight- forward > and 

only o. feu him monvrLoncd below, I'.oe the WAP.? listing J ° 0:r tVi0 others. 
C0"- T I: Tnis is reoihly a simple rout lac-., but Is mentioned 
becau.sc it uses a so- call led shorn W-d.e p-pn^ rated by the 
macro SIH'AB). Sec the description of the -onrine SHTLK (in 
KISC) in the A r AR? list Imp; for the format of these kinds of table 
TeH): This directive is a little more complex than the others 
in DIRECT. There are two actions depend hit; on whether WARP 
is being initialized (see section j) or whether it is a 
normal assembly. In the latter Case., several scans of the 
main table are made to output all the tables and other 
information needed by DDT. 

?(1J: This directive is of interest praxiarf".;/ because the code 
for handling equated symbols (routine ASCI, see ligure '4-12) 
is located here . 

0?D : The code may seem a bit obscure., but if it is relieved 
l) with figure k-2 at hand it should make sense. The action is 

{^J slightly different if 3MRP is being initialised (see section ?)• 
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ignored j.iiu.l„l o;io Ox one p-^uccrns in- , .L.ao_ , , ^^jdj. o. - 



^ ' • ^ C pi ' (j •' t i on • d! y r. s e mbl v an d rn .' v - ^ o s 

6 • ' L Ir statements (IF, EhS?, ELSE, E^A"') 

Processing if statements Is prlnu.ri.Iy o. matter of deciding whex 
to assemble NAEP statements and when to ship then, The shlpoing 
is the central problem since it involves string herdlirg, i.e., the 
statements to be skipped arc looked anon as strie g; f.u.t arc to be 

t-nv O'" 1 '" 

appears in an appropriate place within a string. The skipping is 
done by a routine called PACKET (described in more detail in 
section 6.\') which reads characters and switcher, on them to 
various actions. The usual action is to loon lock and read the 
next ehsracAr and switch on it. hloucver, it ;.s obvious that some 
of the actions must do a bit core o^ the TEA? closing off the 17 
will never be seen. Thus co.to actions change a mode., EMOBlh, which 
is usee to defect the opcode field of a statement, . 'fee logic is 
rather simple, the first occurrence of a blank sets the node to 
"after label field blank", and an "I" or an "E" encountered in 
this node causes C"S£ to be called to delimit the symbol in the 

opcode field. After GEE returns > the delimited symbol is looked V„y 

up in a short table containing the following: 
If? 

ELSF 
ELSE 
EILF 
If the symbol is found in this table then a special action is 
taken., otherwise PACKET goes on switching on characters. The 
advantage of this scheme should be obvious: it is very fust since 
most of the tune characters are simply skipped add GEE is called 
only rarely. 

Following is the algorithm for processing if statements 3 written 
in pseudo- ALGOL. 

•procedure ifskp (elsf, c-2.se, enSf); Irbel elsf, else-, endf; 

begin cowe-nt This procedure skips statements until it 

encounters one with ELSF., ELSE,, or Eh"DF in the opcode field. 

It then exits to the corresponding label given as input. Only 

ELSF 5 ELSE;, and EILDF on the top level are conrieered., since k.y' 

all other occurrences belong to lover level IF-EEAP blocks 

(which are to be skipped); mlev:=0; comment mlev keeps track 
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^y 
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*>-?. 



of whether wo are inside a lower lev J j. r1 --.TM r ."C" K< nch 
loop: EACX1T; cowont This coll will i. 
until one occurs vrlth "I" or "E" as the i 
of" the o-o cede field.: Odd:; Zoo. run ' " . 



"bran en \U">. , clt,^;, el;;ox, ond.!:, loo;;}; £c hi? la --" encode 

is not one of the four special opcodes., flu ".-ranch Ik co Hoop; 

i,;"x: mlev: klIcv +1; .£kltL2 loop; 

clsfx: Ed>to Lf o.lcv --0 thon drd dbfe loon; 

olsex: doth _if r! .~v -0 then -,lso oj ge loo;;; 

eiufx: rnlev: -- :ddcv -1; : joto :_£ ulev < dl_d2 ^"^ '- ' r '■' loop 

udl procedure Ifsho; 

£dilddldi. - L ' he above procedure is no: an exact replica od ZirSKP 

in flYR? since it shows, soa-e of t^c actions /.'hich really 

belong to PACKI'l., but it conveys the •.v:.In Ideas. Fa' lowing 

is the code for processing the directives Up ILLS?, ELSE,, and 

EEL? . ; 

IF: .".rent: - ifcrd -1; coa unb ifcrd is initially p> so 

when it has a negative value an id sfcaf orient Is being 

processed • 
ifl: evaluate expression in operand field; 

J_V valve of cxnression > thpn goto CLOOP; oom/riont 

Thus the code 5 ."mediately following the IF statement 

will be processed. Mote that the only thing which 

indicates that we are inside an if body is that 

ifent < 0.; 

ifskp (ifl, CLOOP, eralfl); 

comment ELSF . ELSE . S'dBF ; 
ELSE: 
ELSE: if ifent = the n brain error ('S' ); goto CLOOP; comncnt 

outside if body erd; 
o3sel : ifshp (ELS?, elsel, erdf); 

comme nt ELSE ELSE EXDP ; 

EMEF: 2zJL ' '" crit "° ii2idl Illidaa error ('S'); ZtS-Si CLCOP; ct<m- : ^ 

oaf.-, I do if body oici; 
enclQ: :i Vent : -- "fcnl; -;-].; ^oto CLOOP; 

Although it may not seem at first as if the above code can handle 



it r.t at (.-'merit c in their entirety,, ti;o reaCa-r- a: y t, -y a JV a 
excunpJca to convince ialmnal r ttac ■'.,;{ ^ra 1',: . aa; . t' e a:t>.'<' 
may look si'^'iCj '->ut it can haaatc Jt ,t,atea' >.,._; r.j!;<..;.l to „av 
level, and yet, oa":y one word ot k/.l. - ;,' a: ' r-a'ea to toeo trae-: 
of what ij a° : '- n a on ? namely li-fVra (note thaL. -'.Teat i£ the leva, 
of the if statement be5.n^ "orocosned at any given r.o.v.'-nt ) . 
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^ - ^ Or aagizo.t lor of string storage 

Whereas If statements require only a certain amount of string 
handling the processing of repeat stater jr.; s and macros is almost 
solely concerned with si n\_\; hardl: '.3. rhip'r: "a^o", s more co> 
plicaied because strings .nest be saved in memory for later use. 
.The area of memory allctr?d for those strirgs (,-f.-\ng storage or 
ss) is organized as follows: 



nngiier oraer 
addresses 



macro be; 



</ 



free 




str^v 



storage 



repeat bodies and 
macro calls 



. ~>br / 



KO 



The b or ads of as are kept truck of by two pointers, S3? ond I.SS?. 
both of which a] ways point to a word that contains useful information. 

When a macro is defined (by MICHO and ElflM),, the macro body 
is stored in the top part {i.e., lover order addresr.es) of ss 3 
after the last macro body. SS? Is edvar.ced accordingly (OSP always 
gets larger as there Is currently no garbage collection for macro 



bodies). On the other hand, wl 



xv. . ■ o. 



reneat bod'/' or macro call (m; 



aero 



calls contain arguments which must be stored until the call Is 
complete) is stored., the strings will only be needed for a short 
time;, so they are stored at the bottom of ss. As soon as thj repeat 
terminates or the macro call is flnistccp 331? :.c restored to its 
previous value so that the repeat body or macro arguments are 
discarded. Thus B.GS? gets both larger and smaller. 
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. 3 Rovat statements (l-Tit, CPdlih Vi "l) 

handling repeat statements is done i:i ~wo francs: sroring the 
repeat body 5 and then letting lifSP oroccss tie statements of the 
repeat body as many times c^ specif ice! by the repeat heac. 

The main problem of trie first ph.. .so is to find the iifR 
matching the opening kii or GRPI. The alert recdor v.'ill notice 
that this is precisely the same problem as finding the- EliDf to 
natch an IP; the main difference is thuc the characters between are 
not skipped but are stored in ss. The routine P/CXIT (now the 
name should have some mnemonic value for the reader) is used to 
store the characters in ss and to find the i.;r/rirat _rg dfibP. 

Initially s the repeat is stored in the J over order -dc'r; ,ses 
of sc, just after the last macro "bedy^ bat a^ soon as the hNPP is 
encountered., the entire repeat is moved to the higher order 
addresses of ss and 1SS? is altered accordingly. Sec- figure 6-1 
for the precise format of the repeat in ss. Of coarse., the con- 
trolled variable -portion of the stored repeat is built up before 
PACKED is caUcd to find EKBR. (_} 

As soon as the repeat statement is all set -jp in ss as in 
figure 6-l.j the exr>r area is examined (this mcy involve the 
evaluation of an expression) to see if the repeat body is to be 
processed at .least once. If so, the current input pointer is 
stacked and then fixed up so it points to the first character of 
the repeat body. A jump to CLOOP is made and NAP? is off pro- 
cessing the repeat body. When the special endrpt character (which 
replaces the symbol "ENDR" in ss) is encountered, the following 
action takes place: 

ERPT: Use the input pointer (which contains address D; 
see figure 6-l) to locate id and thus find the 
beginning of the controlled variables. 

Work throtigh the controlled variable area 3 incre- 
menting each symbol. At the end of this process 
address C is available t so stick it in fTPdCT. 

if PIS: Use the ~Lrr.il -pointer to locate ■■■' and Pts Uric! the 
beginning of the oxpr a^oa. Use the first, charac- 
ter of this area to decide what action to take: 



o 

Vd> 



"i r ui"C r )-] : Format of n --co"A"nb boav 



A: /] 
/ ! 



exor 



* :> t77 / 7 /'■;•' 



. i. i 



/ ; 



V 



i /i/r m *< 



n 



C: 






I \ i 



repeat body 



C-: 



o^ .BS.SP 



OXO"' 



i^ controlled 
( variables 



J 



higher order 
addresses 



/( 




^ 


! .!/ / 7771 << i 


?• 


vjJZUL ^ _J 



In {'Orxvai 



char value 



-m., 



"o->n;>, o; 
c:.'j /• iCi.'.r 

r ot : 



r;imb;,. rot: nor '..". only one 
word: \ he r'-ma'.nin<' iC bits form, 
a binary Aur.'hc ■" , a - j^f,l brat. :_s 
dec. . i crib eo. ..abi to . ore rcpca~ 
boey i.s processed oroil it 
becomes 7CTO. 

ev mo: evor '" s or", y one word; 

1 ) ho a ** r rumbe'*'' a beer r ' 'i o^c 
, no* r con^ron '.^ a vnr „<..'/ „e ' 
a. o -"c :oo.v, body In p:-' ec^.scd irrm 
id o value of bnc f:'-a; corivrclioc 
Vo.rij ;lc passes In is bound (doe 
to o j; rdix -''■." roj" a ere r.u/ go 
no conx .rolled variables;. 

croo: the following characters 
are an expression Terminating In 
a carriage rexure (cxpr Is then 
a variable numb or oi" words). 



contra.!!! od ao-yhc^'rlos : This area corresponds to the increment lai/c oi « repeat stuboment; 

it nay 1)0 emooy. 

<^b : address of the fourth word of a main table entry for a symbol 
/fj : Increment to be added to the symbol Indicated "oy o{ each time repeat body Is 
processed 

-hw'Sil aPilhr Tais i" the string of characters comprising the roooat body. The last 
c. u racier of Lhe striae rs always a special endrpb character (value 20 , +B); it is always 
preceded by a carriage return or a semi- col on. 

__ v : Co.iro of wo^ds from first word of controlled variable area through last word of 
rcrneat body . Ihls count ir> used to find the controlled variables when the last character 
of the repeat body has bean processed. 

c:f : Count of words from first word of expr through b' word. 'Ibis count is used to find the 
beginning of the entire stored repeat when the controlled variables have all been 
incremented. 



/' I;3S? is shown with the value it has just after the repeat Is 
^/a~aller values during the processing of the reread body. 



stored; It may take on 



c 
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;,0'.,o . i , i .ty.',^ 



V. 



2 : p-oto EVALt'8 

SIMP: Subtract 1 from count and leave count ; n A: •"•olo SPIATI 1 - '' 

BOUKD: Check if first corn, roiled, variable is past bound 

and J cave 6 in A if so^ otherwise / <■> in A; ~oio ERPTa 

EVALU8: Evaluate expression and leave value in A; 

ERPSJi. : 12 A / p hen set up Input pointer to bc:p.n..in^ of 

repeat body (usinj SfZXl) and ->oio CHOP c? <:c change 
3CS? to point to G, ;';et old input pointer from 
stack, and £oto C100?; 

By trying an example the reader should convince h lentil 'chat this 
sche/ne handles nested repeats, as well as macro calls v/ilhin 
repeats and vice versa. The only information that need be saved 
(outside the ss) to make these nested structures work is nhe current 
input pointer (see section 3-l)« 



v.. 



6 , ■ ' 3 ! or irg strings ( PACKIT r oe i", no ) 

In the- preceding discussion the routine PACXIT has 'k\ , i aero; ioned 
several times and even described to a cert mix extent., in p<-.rt K-ular 
in section 6-]. However this deec^-i rtlon >eas ox iuntcd to the u,;c 
of PACXIT in processing if sewmomorms , whereas most of bhe features 
of PACXIT are need fo"" process ibg repeat r, I... leme.nl r-: and macro 
definitions . 

The in out to PACXIT is implicit: 

I.) The input pom;or ie positioned at the be;', inning of 

a statement. 
P.) COPT A3 contains ohe address of a short tabic. 
3.) DTAB has "been modified. 
(The last two lines above will "be clarified shortly.) When 
entered, PACKIT reads characters and d ' srx~lcb.es on them vie LrA3. 
The format or ^i entry in DTAB is snown here: 



c 



o i 



fry! 



9 10 



23 



cX 



C 



/OS 



p means eon- alphanumeric character , 1 means alphanumeric 
An integer used "oy SShSS (described in section 6.p) 
Address of a piece of code in PACKIT 

C,«n +T--o l\TA"0"O "1 •; o -i- •? rv <-r f , n I'/AO X f*rvi* -M-n o~>,-r>-; P." IT"'' : . - ---'•' o •-- Th -7 •■. 

(JcC OiiO l\iix\^ Jim. S U J. i ij, \_ Lil 1'^ i.^ y _i (Jr oliO S OC C L~ .Lv. L/„- „.' v. ^ ^ i. icu * iulo 

table is in MAC because in creatine; it, arithmetic if performed 
on labels defined in MAC. Is ^oon as HAP.? is started, hovever, 
the table is moved to alterable memory (in TXItPj because some 
entries in DTAB ere changed from time to time. 

For most characters, the piece of code indie ,,ed by <?X"is PACXM, 
which simply packs the character into string storaae, (Note: The 
basic packing routine is PACK, an alterable routine located in 
TRM? after the basic input routines: when IP is using PACKIT, the 
routine PACK is altered so that it doesn't pack any characters.) 
As described in section 6.1, a rather simple mode mechanism 
indicates whether the opcode field is being processed or not. Now, 
inputs 2.) and 3-) mentioned above can be clarified: 



i -.-p 



2.) The short tabic indicated 'by IXITAI contains tie cy. , 
which arc of inccres'i; <;o tr.'i naff."; cc.13.uyi b:.01TT: 

if state loris rvo :ai rh.na^ rts aa-vo o^f-in 



iM,b_, „n\_.-. c\j-y.y argument 

E2\lD? generated 

j'or macro definitions ,> the dua:iy argument and generated 
may he diffcr>~.t for each macro; this tabic is in 
alterable memory because it is., of course, chcy ged 
depc-r.dirp; or. the macro. 
3.) IT A3 is modified so that the first charge cere of the 

symbols in the above tables cause dispatches to special 
pieces of code: 

if . statements X eye at s t at c me j ~<t s nacro definitions 

I R M 

e c ;: 



Except for "d" and "g" the dispatch is to SPECL in PfCIlIT 
which calls CiiS (if 1:1032 is right) and dees a lockup 
in the tthle iudiw^jed by iOffAB, If the lookup is 
successfully a branch is made to a piece of code indicated 
by the table. For "d" and ',p;" s IIO.'OZ is not interrogated, 
but instead, a check is made to see if the preceding 
character was alphanumeric . If not, CZCJ is called;, etc. 
The mechanisms described above arc designed essentially to 
enable a scanning algorithm that looks at one character at a time 
to look occasionally at a wider universe >, namely a symbol. However 
there are lesser tasks to be considered: 

"($" is converted to a special character 
".&" is removed (.after changing /PTM to '■■)) 
bleats are compress^., 
lice first two tasks mentioned above arc rather st;v.igni~ forward and 
can be easily followed in the l.f{? listing. The !.„„': one how- 
ever requires some comment since the conventions about blanks bet- 
ween quotes and between parentheses are a bit complex (see the 



\~s 



<L) 



KAR? reference manual). The c..cic:lon on whni:a^ 
ur not is made by interrogating 33f.-b3,, which n-..e; 
structure : 



o-LO 



.o cranio.'; e > 
/."u; foIL"; Ov.f.r:' 







f ' 10 



cr.t 






sqb: sin;;!-, quote "boolean: 1 when betvc.r. clo;;le quotes 
dqb: double quote boolean: 1 raen between ciii rr.otes 
cut: parentheses count: > n^ans be a/ocn rare:!; .jc^s, value 

is level of nesting 
If bfbiXb i.^ zero (normal state) then CQ713o ouould be -;;vaivc, rjo 
that GNLC eonmrosscs blanks. 
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act"-'; •y'CTQ , fiT^HM, macro ed 1 i.O 



In? directive -blddf) signals the beglurctm'; of a macro def- 
inition. The syubol in the label field is stored in the main 
tabic as an encode, it:; value being the aobhess o" -,1c macro ir. 
string sio'-ogo. T.P a dummy symbol and a rorcreton svebo? '..Biicra, 
they ar stored in, b"c tab"' e Mafi.' ('•':■ viib'1 1 a.,.! thci) " D ''CI"1" is 
cabled to peek av;av the macro bony \;.\ ss aiU to fir.;; tic closier 
ibbOVi. f '.(;c fb;;in-e 1-2 dor the fdnaaod of a r>aoro as stored hi as. 

'i'ne latere:, \,\ru\ y :rl <>V a icro r/eGCOssirir, is :,iccr j ozrarej .„a. 
When a maceo in called, the aYyyeeuts dor bra.' cA .". >_ .: d^ard in 
sr> along with 'pointers to them., rhe number of" a--"gamcuts , iaa'o-ra- 
tion about generated symbols., a; d a nc'^acr to tie- •, rcicsb'.e- 
macro call ( If there is one). See firaroe 6-3 arid 6-1. A:; for 
repeat bodies, the arguments are first stor.rl tuirt a "dun* ade lest 
macro body in ss, and elder they have All been read in, a'e moved 
to the bottom of us (higher order addresses)., T'ie roucine for 
reading arguments from the source and etorieg them in ss is 
SSTMSS. It reads one character at a time and switches on bTAB 
(us inrr a different part of a 0?AB word than FACXLT). T'he logic 
of b^'eTA. i.: rather sim-ole, its main task Is to find an argument 
terminal, or; see the NARP listira. 

Once the arguments have been collected., the current input 
pointer is stacked and then altered to point to the first char- 
acter of the macro body. A jump to CLOOP is then made and NARP is 
off nroeessing tie macro, Durins this -orocossing the directives 
NARG and TCC'HR mav appear. These axe handled quite sirrpiy; 

I\ T ARG: The word ARC-ACC always points to the current macro 

call (it is p if idAH? is not expanding a macro ) 3 where the 

number of arguments is stored. 

NCER: The routine SS1\SS is called., since an operand for 

UACR looks exactly like a macro argument. Ahuc that this 

directive can also a-spear outside macros. 
Of more interest are the special characters aaai may be encountered 
during macro cxTxaision; thev arc of course stored by PACKET 
during macro definition: 



w 



d„b 
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Figure 6-2: Format of a stored Tiaero body 



■\ 



V////A 


cA 


T/lTiT 


/' 


-5 i X 



\ 



> macro head 



i h 



macro body 



y< — GbP 



macro he 



aeareos oi iourun uo?a oi a main 
v„.'..ls entry for a macro opcode. 

count o':' words used by .macro head 
and macro uo>.'/ 



'/ : base o~° g>.. r.u 
value Lc - "?, ). 



:^ymr 






Y 



increir.cn c to au,.. ■ 

- macro is active; 1 



macro is 






raco Dov ,r *. anifl iy uric a,::'irg oi c..r.",'.'r;or'. come a. 



,he macro bod/. aha 



last character of the string is always ti special endnac character (value 202B); 
it Is ajway^ preceded by a carriage return or a aami-cojo..., 



(i\ T cte: The elements '->'.. /i 5 and ^ in the macro he: ad am 
garbage collector which currently does net exist.; 



j ao be usoe oy a 



F i gar e ^ .- 3 : F- )rnat o? a call oi? a m ac^o ^~~' th no dummy a rgu ment 



A[\-jJa-0 



eve a <m", ,...mc 







The only piece of information that actually needs to bu saved in this case 
is AP.G-ACC: however, two dummy words are introduced so than the format is 
compatible with a call of a macro with a dummy argument (eee figure b-h) . 



£ 



a ciji of a macro wi •■',:; cn'-my arr' 1 ^?!-' 



•- — y 



•j BGlv ? 



pinte 



rs"N 



arauwent;: 



' " ■ 


f iJZ/u. 


on'- iv ; - ^ 


> PVyC: 


wi, n ^ 


1 


1 j/^^r— 


-* ^i * i 




^L. . 




! sav 


JO. C..>"'\ V<.C,\_' 


1 n : 


! maX 
L 


7 - 'it 

! o ^ °- ^ u ,,; 



_._" ■"porit'-;: 'Racn argument "i :-» stored as 

_h; 'i,"e ;c '/•".." ".,v a;; 'Lvcn la era. ce./rce 









; la rro^rcd 



as> 

( AL 

"\ / 

.tv 

-'- /' ; 

v .o cnclec .r ~' ;>• .rcnt : >c:h c < 
erppl sea ^•fore the ;•-'■. or.' 

one --fcarr tae ov, >or ei>>h r 

strings -.v'o stored, as ,: {) ; ". iba.^rt 

$ (ohe jabel ilield) always is stce-ea. 



ec ai 


in 




vrce - 


-,rcg 


r> -"; 


en ie> 


- ■ ,- a 


L <t 


'c i £ 


*-• r- , r 


od. 


are , 


l"CQ~~ 


t"G 


ga-Tjc 




,"> "s 



o 



■pointers : 



'ach argument is --jointed to by a pointer 5 a lo-bib character au.rcac / .~, 
which points; at the opening left ncJronthcanic' of tac argument. Abie that tlv.ro ic i^_y 
an extra pointer (pn'c r.-i ] ) which would point to argument n J 1 if 1 \'?o were ..me 
more argument . 

saved argacc : This is the previous contents of :f -htlGC . VJbor. the moero ca21 is 
comole.,ed, this value will be placed back in AhCAJC go that it points to a higher 
Hevcl macro call. 

n: This is the number of arguments supplied by the macro call (r>0). 

g mas-: : Ibis is the maximitn valkn, that a subscript of a generated symbol 
may have (l < g max; d 1023). Tnis value ic the GoS:,e as 'ij in figure 6-2. 

g bare : This ic the base value for generated symbols for this call. Thus 
the generated symbol G(2) wiUl result in Gfix., where x is a digit string 
representing the value of g base + 2. 



O- 



\^ J 



DCA-'.Cp oor-iV'o;: . .;_, vre;v 

\^ ' v . v <~* y j*/ s. V- > .. . . ... „;v.- y " .'.' '. v.- - -- - .. ' . ... * , ' ' . 

Ghee one of ihc above characters I:; cr.o.u-i...C'ri J^ ,„ . <rai.eh La 
made ;;o the ap;>r./nr in, ^ ?~ocfiee in GdCG 
DIG:) — -/ _gUi DIG 
fdlGfGh roGo DGh 

The uctjo'i: of DIG and GIGS are very ei e , a ire foiG..aGp<; cx- 

preee.ajn :! e evaluated end then converted to a apjit siri:p which 
is stores i'a an. array ri TGG?. trie crprt pointer as ca...e.nc arc 

then altered to tdx input fro.a zhiz array. Tie routine GilAC 

is ever simpler; it uses the aaved araaec (see fip..res 6-3 cud G-4) 
"co restore ARGACC to its formar value., resets GSt? so the macro 

call uic appears froa ss_, and unstaeln the Input pointer. Of 

(T j) the speciai character routines s GGh coca the aoei -«rorhp pirrtuaily 

because it must worry about all the special notations for daary 
a.r5i"ur;:erit^ ( ce tic WAR? reference manual). See figure 6-5 iGr « 
flowchart of the main action of DlIM: sottind DrfiliE to the address 
of the tiret character of the arsaiaent or piece of arfuncnt that 
is specified., aivd .settrnp; IGJiiii to toe dearth oi the character 
strire; specified. 

One fine point whctf-G be rerrtiered: Socictl.ec when one of 
the special characters is eraorntered s it should not cause a 
special action but should sirno.ly be passed on. Here is an example: 

A hACGO D 

B IGARG 

KPT (I=1,B) 

DATA 



JJK J- J 



GGDK 
EhDM 

m When the macro body is stored, the strirj "f{" ineh.de the repeat 

Jx_r body will be converted to ,^ -, vh,u-e c~6 IGG.IG7. IGup x/hea the rcacro 

is expanded- the repeat veil be encounocrec_ aiui its repeat body 



will bo cloved In Lhe bottom of zz . I},r\iiig IKJc zU>c '-,.?, , '• v'Ad be 
encountered. Ii' at thio poirt the rcvlilm; ibjg in v rccuted (i.e., 
the expression "I" is evaluated) then the renom; ivi3J be cuoraO as 

h Af {1-J > B ) 

DATA XYZ 

hAlbb 
assuming that D{2.'j - ACAb'A ObvAoucly tbis io rot vnat is WuC.ed. 
To achieve the desired resxiAc., <s is ciirxly stored otth the repeat 
body and no errrreojion evaluation A; dor.c. T' ,o Amnion of whether 
to process - < or sArp.y pass it on ( here A reay bo fAAAAA or bAA-3 
as well an DIJIaDbb") is made by interrogating rXPAT"";. ";. r.ey are 
one of the i.'o.llo\r..ng values; 

-1 norma?, (i.e., not rAor-.y; anytb by- _, 

storing a macro 

1 storing a repeat or Acinping io.Acv- 
Thc decision is made according to the table below: 



:,o •orccecc. 



s r aa c ae n c 



v J 









i'-orisj a 
nacro body 



st-orm" a 



.reiw.; o 



j inside an if 
statement 



jj._.A;u.M ; C. Aab.1 



roand ; exranrd 



saore 



So ore 



acore 



caore 



nomal 



expand 



enoonc 






store 



ztot 



store 



I expand 



Con sic" or the following string brer jor.e r.acro body: 
ADD ABC4bbF„&b{bJ-a) 



During exnaneion whea the ^ ■ 



\~ e.i_c~e .- 



.:.■: or covers eon. ane 



exprc-rsion "Id- hi" '/.111 bo evaluated. This vil! involve calls of 



W 



6-.VS 



C 



•/p-.»f 



.-,=-. A- 



or ;.r.Licn , ;,; J .on o: 



Gixbrov.'tl r.<_,; 



■:) 



U \ 



j n(e) a v c ; cji 



j 3 stc-.-;j : -- r. 



r \ : DSU?(lst,:ir«0 ! 
.f v_ 




\ i 
ap_Tr,e : -ar«;acc ( x'starg } ;1 

dtir.u.e : -ar^E-cc {istarg 

+l)-dpnte -1 



err- or ( ' S • ) I — M lstclir : =j.'stchi 



MUl-UC 



4^'' 



^ 



C-.-.7 



C 



''i^ur-p A -^t DU M; Flowchart lor clcli- 



(cor.fc'd) 



V 



d(o$e s ) dfeS^G 
d($,) d(S,c) I 
d($c>) D($e,c) J 




,'j.stcb :^.Til..-.(..3tchr^j astco) ' 



i 



V / 



V.. 



dpnte : -ar-a'aec ( rotary ),' 
I -i- fctchr; j 

jdanQe:-lr>tchr - 
I fstchr + 1 i 



FJLSTG: l^To change 
of input source. 

swryffici: switch 

input source to 
an argument . 




C 



QXT, aiul Ehlh\. however., at the ii:o;"ont >v is cnsoounlvreaj 03:.'! .' ,-; 
busy ic 1 1'ilt'lii^ a symbol (ha, first earl n" v":c'i h, "DEI''') f-'o IX?: 
is "busy evfe^aiiiat mi exre-uss ion (idse lUrd, wn d which is "ABC* 
!'-<!.' .'. iiii .; as. ' ana <VS\ jU.'.j ou r^,ci:-.:iV( I'cj'-oii^i.'../;. ;T( 
character staeh in Chi' and the operand and o sera 1 or viae, s hs 
EX?h uorh fire during recursion, "but; a" 1 ! th, v;crkiaa col""?- ef 
these ^outhsss, u.s veil as their- return ....wdecssss ..avl ir s.na,d 
somewhere. This is the function of the aide: wln.:.c\x'* ibhf 

that trust he preserved (ol course;, rosi.iee„ other a' .a .h.h ....a 
GEE rust, bo cons hh-'-ed} cr<-a -nlaecd ca the rhio by Pdlh whs kllC); 
uoor. exit fuei Ihesc -ouflaes, CEhl 'also la hISC) takes irf-v- 
i.ial ion odd the p'le nad placc^ Lt hi hie -v/reeer: ale cells, .1 is 
scheme also enables eo isirucis ."."da. E.Ebhlx d;d"0 ta ho process' d, 
sine: Thai, dhh^ ;a rid DIG are reeu-sIvo„ 

The reader should note that enouah Infornad -as la stored 
with a racro cell so that ussro cells: in.ihi.ri macro e "Is, ra-c 
handed quite naturally, The lav to this is the savi.v, "> ~~ 
ARCv.CC with the macro call; thus \ ..en one call is fini.d'ucf 
AHCelCC is restored to its value before tee call, whfeh r.ay re an. 
that at points to a higher level call. This saea sch.c-v.ie Also 
handles - : recursive macro calls,, with absolutely no nrlra raachinerv 
added. The reader shoald trv a fev examples; as an exercise band- 
assemble the follow! a™ proarain 5 observinp; how ss changes: 

A 



MACRO 


D 


TV 


D{lV ESTOP' 


DATA 


D(2) 


ELSE 




ppm 




DATA 


D(2) 


A 


'STOP 5 ,, ALPHA 


ENDR 





(L) 



EhhDM 

U 'GQ',B!hTA 

EJD 



["he generated, code is 

DATA L' 

da-ai a. 

d;a:a r 



7.0 r v i f, ! , U L z in,' , rA 






it .i j 



A;"ter Lie symbolic \vrs,-r cf .:.'' 
loaded by HiT, core Hooks lido 



xir.aoe _i. 




- - ^ '' - " -a voravr"' 
aj o_. ^ ;■ c, ;. a ~ . "r "■ n e ' j, can- 
ard inaciolized aeros 



any cnie 



At this oolnt., none of the , a-defin.,! cco-ln: {.""D 11? etc..) nr . 
in ohe radix table, encept for G?B, In order to nail a; these opcode. 
H/ill s^oaid be started at 1.:/.' (sec :..p _•.,.. 7-1 c 1 7-2 for dfoail:),, 
and a special program containing do "irl'rfons of a:i.. cde orc-a^fincd 
opcodes and pro-defined symbols sic. Id be as serfs- led. At w.; end 
of tbrs ssscr-bly 5 bloel d \.j">". contain all tie do!', llions . 

II '115 co-old ban. 'la srb- systems v/hich consisted partly of 
reas-a-ate blocks end pad?y c? reed- only biotas, core would bo 
lor oroat ice; tie sub- system ill:?.. ..over/or, since TSS 



C.J ,.. t, V 



only nancies read-only blocks it Is necessary to novo block £ 
to block .': before placing Nil? on the drun as a sub- system. lhi£ 
is done airtonaLically at ;hc end of the asse-- of tie initial- 

ization program., so core looks like 






Image II 




v .lore bloc! of wcrkino 
' a*cr;r P.o\~r contains til 
"■ ' pr s~a>~ iii.it ions . 

J \~ read-only code of ;-!AR? 



• J 
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This is the core Image that goes on tie dram r,s Lie h/Id J sub- 
system; of course all four blocks arc road- only. 

A T cv/ y when a user calls 1777hlg cone loa;e II I hro\ght Into 

memory., and 7117? II smarted ;7 lahm , •;...■ ana na.ehxe In 

block 5. This code moves block -:- ami:, no Hoi. /^ a: ,t v ,, III loop 
(so block p it; reaJ-write) and then cla:g,es the rel„bet~leg so 
bloc]; h is thrown away, how cone looks like image 1 above., 
except ahaa "block 6 contains all prcdefinitions. The snaciy.^d 
source file cr. mow he assedolcn. 

The processing of the directive .1 1,7"/ ".' is hnlmahdy tiee in. 
wath mriiakhl^dng aai smarting JXTdd?, -a..*/ I .kh7 indication arc - ram 
should c-.d ah' " 

-,,c- mi,. - ,• -' - ■"-+- 

ferer.ee beuweon the START entry and the COlllttl - .tr •■• ;" s thaa ahe 
former area move hi. eh J- 1 - to hloeh ft). Here are two nieces of 
information that mast he saved to implement IlhllhrTk the viilu.ee; of 
initialized temps (which may change furlrg one esscmef y, and mast 
therefore he restored "before tin, neat .„eeeahly) s and po;;': -as to 
the main table and string storage, ahe first piece of Information 

is saved iast once \h 11111 1-, initial isr-d;; the Second piece Is 

saved every time Hill' 7 ! aprca/na Bole pieces ofd-af; e ■ alee are 

restored every tire 1711? is a" anted or cos .eared (?<. : - tla.v, 

before the Stall or CCsTCal.' entry of hAhp 5e used, the 11:1! c:rary 

has been used,, and 1111711 has occurred at the end of the initialization 

program). 

Figures 7-1 ar.d 7-2 should gi.e cnoogh details about the 
processes described above. The file opening logic may seem a bit 
obscure j bat the idea behind It Is to first collect the names of 
all the files that are to be opened and then open ther; in the order 
they were given. Thuo.^ a user can type In the file names and 
walk away from his teletype while the files arc opened (\7nich can 
take time if the tape mast be moved) a. a. the aa:;embly is dor cm- 
The file opening code will have to be clanged somewhat to imple- 
ment a.-c description of how i~o start 7771.? ar. given in the reference 
manual . 

Ceo section 8 for some cerements ce he; alio initialization 
scheme will have to be changed to accommodate pre-defined ma: es . 
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area; 



'.n tVI 



/"" 



.'£ 



bktw j : ocl: l - 



\ do inters ("ch, /c v/^rc 
I ijav&d by t'JCkZS) 



this code 
locatec. ax 
be^i'iim/, c 

■VrTop 






'l open flic 



e 



cicgi? 



\ 



;-,b,U <■ 



zr.e input j 

.jurming cc! 

ri IXC1CCT v/iion : 

'■'^ elective ' 

l,a;jic yoii b '.^ in 

olIC ^ Xu-iij/j ^i di.Oo. j 



Vi -; 



ap-'je, 



'...;Wo 



1 



coao located. 



Lr; DltvSCQ 



ye£ 



move block f> 
to block U 
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HIT: ii^vr :>'•■■; <•>-. ki JJT to hh:^ 

dcd. '. '.< ._,'.c\ uO ..ci' ti'ah tabb .". i. )"i ._,.._'. . O. ' V ,\j ; 

ir-i\,l.'/.i".c Lbo ii'Dorr b> una o~' tie -p-Do {,■>..."-• - 1 . ■ - -h ■ ^ Jtr^i,; 1 . ; 
;;ec cclon i,i); save Initial I :-,c u *.. .,.„ i.-i be ib,. , „•,..., 

STiYRT: .v>vo b? oeh : -; to block fi; 

py.o rid of "block U; 



CC0£ 



i'i ." re t:blo pointed;: .he;, were i^-a, c by i-.bbi.Zi; 

coco ooie "cable., sctti..;, wore ■s vb all Tice:af . : o:.c to ;;c?o; 

"co zero j as well as re„.ovi:v a nara teaa over frc.'i luring 
"Co.. anae^. laea sy.iaoe^ ... aa^a ..acaxcal oncer; 

oo.,o Cblhb?; 



'-._y 



KbbbbbJ; c.vi table pointers in llilbZb area; 
rejo ObOGf; 

END; -:_; D.T.'J > C baai /aoto IbbbiKD; b„b 

chock M\ .C!'j"I* ana iii? to see if Inxxlalization proy/ex: wai: okay; 
gtzx, rid of ell blocks between block p and first red- only block; 
close all files; 
type "ir.it complete"; 
neve block ft to block b; 
brs 10; 

KRi'ISi\iD: output literals; 
brs 10; 



The following macros arc used by the above rouxines: 

saving ond res honing initialized temps: 

ITkb/E : moves the initialized tempo defined by the macro Tib 
IlblOvE : moves the initialized "ores that do not lie in a 

contiguous block but are spread throughout the working area. 

saving and restoring table pointers: 

IftiWk : moves initial reference tables s hi?., SS?, arl \-IDl-\TDk-. 
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u 1 "Irritiep , 'hi ' v r r. .id a *aefd'u' 



hollowing" ..■; <i li:,i of r_. clcL-wj 
arranged in any nam neuter ord*"-. 

progrun n .. 'jj"'"c y ",jOj_r. c „ — lo era i"a 

the onher with no an^r infer vert ion. : 

clcei:r3 filer, in HTT.0, read the ricx 1 ; e"i 






13TB. If not then aseuue it is eh., first character of a n„.-/ v-c- 
U,rarr. and cr;senbje it. 

2.) IutroGu.ce a new directive i/iii- voYdhd allow the uo.r to 
chuuyy the sine of -the dvdoj L,cod iy did?: .- .wr mile., k, ;dc, 

^i,i. t^*iJ«.*O.L v> ^ <*.v\J^ U —• L> ^., -■-.* w <-w.y „.„ C- >J> GO t^ i.a,„„ l* l ~ _™ii. u _i.l, O^ „ \v^..v./_. \ ' .... j „.. 

this, a rr.o'-c ehbora^e nan::;ae.e should he pi" into a el ,~. a oabae 
overflows co lie programme* knows vda.c- table it ■ e;e» 

3.) The hit in are symbol tabic to designate that a syrbol 
is generated is naperfiuous; the forgot bit can be used. Also, 
ASGh should interrogate CESCOL and set the generated (or f organ 
bit) accordingly, (dight now C','3) 2CJJ .1 will nee- be narked 



j 1 *i 

" o v 1 rj "*— *^ '" ("■ n j 



• r|,,- _> 



^ . ) 1'. c\npty nacro body or rep oat body will ca.ac iddd? to 
go wild. Thir, is because the routine 3X.S? alary, ouneofs to 
find a carriage .,n.n.ara or alri- colon, do correct the preblen, 
pack a carriage return in se before end. lay; ?.-lICkh ami then ehangi 
things so the input pointer points to the second;, .cot the first s 
character of a macro body or repeat body, 

5.) Throw out the directives DSC and OCT and make a more 
general directive RADIX which allows the number radio; to be set 
to 2 j 3 3 • - • j 10. 

6.) Perhaps EXPd should recognise -x, where x is a single 
undefined symbol, as a special cace .instead of yjet itiekirg .it 
in the undefined expression tabic. The reason for this is that 
every occurrence of -nn_, where x is external _, will be etue'e in 
the oxorcscion table. 
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7.) _,._ cause laihp doesn't .rep v/.v.ck of ™>on . ; A. f r A' 
symbol Is used, as a 2i!-bfr, -vAIac a:a" \::.on It is ua-o. , .;.: .. :'-:->, :.-g ' s — J 

value j all frx-xma of under Aa d chair--. ;:.ur/c b. An-;>- ,- A ?'-'• „/ 
DTVP. If ohc irc«r^!cer write j jr,:^t.:I-- " A -.J the fALlG.Aopg 
will get into trouble: 

,, ,-• rvr/r- fir:.Aac loain.g: 

AID X _ kx'V.' 07V77B 

lifl) 
The trouble ,, of couroe^ Ij thai: the ?_ofb.r.Oi,t If Ho., of" /-ff 1 
co AID beeovr.es 1.1.. 

: T. ' \0 : wou.ll 1 -j . .< .-_- o ■• -.. .vg -. ^ a 1 a. Aly 
pleasing a? It uca" witter. ;;..; -. p:„ 

9-/ Ahe action of tie rclacionat ogaia.aa-. "--''' . A. ''J'" AaiuL- 

"be ch.rja.; eo ttat their ogcraahj Co ~v lave to Ave the 3.' : 

relocation factor-, and thac era only _rc fh_ vAac„ of nhe 
operand; cenpared for ecu liny,, oat aho relet. .tier factors too. 

1Z-' i^l)=p ,v — ; 

will ho lega" <|Ar.d will he rhe rlRha thing) ao raot r \h^bhcr 
T\l) Is absolute or relocAA r..e„ At prejerA, if "A'."} lb 
relocatable then rhe error L-.02.-je.3e ''■'•'■ ie typed, AA, ccn he 
quite annoying . 

10.) lac Chf directive ha;.; prov..alore for ai raring a macro 
to be defined while processing an tnAcI-A ..ration rro.Tra.n. Thic 
should be reaioved (i.e.,, so nhnt 0?I) can only define instructions 
and directives) eince macro narr.es are,, of course, defined by MACRO. 
(fhe option exists only because of a moment of fuzzy thinking on 
the author ' s rart . ) 

11.) When pre-defined ir.acros are added to the initialization 
prograa, IdlTP will have to be changed slightly. Sight now, it is 
assuaed that there is only one "block of initialised stuff, namely 
"Diode p. Since s trine storage is in another block,, elding tsre- 
dc fined macros will create rr.ore than one block of initialised 
stuff. This aeans NARP will need more than the four blocks it now 
oceanic:-: on the drum. v -- y 
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r'-'OCJlCii'TlCJi] XI's 1 ^ 



-"i , " i -j ■'-'"ft) 



}< "|VJ) 

r -'"ic ooccdc v -v. Ill T5Cir.~; to ncr.;^ 
Xe c.jcrd ■o.ic'k^o ?X '. '.,,.:,.\ ,..oc'. 
ca"'.I' X y, V.-/.1? rX. 

Xo;iXl be coded (aoo i-ovX.-k " 1"C\ Of coovno, X rroc oXy f ar . 
ow co.Xoct n,a,cro XXXo ; -lot rcrw'; Xv "c;, ;X'- XXX' In 






.-I the input poi-.vtc." r/XX; ■j.iTto,' a -■,.--•'.-., <jo collection. X<c Com?vt 
of a rcacro oodj r.i nc (ccx f Ijxre 6-2', Xx been .sccle to /.X-;e 
garbage collection nolaXvely cso.z*r. 



